# Cross-modal conversion
Wan2.1 T2V 14B FusionX VACE GGUF
Apache-2.0
This is a text-to-video quantization model that undergoes quantization conversion based on a specific base model and supports various video generation tasks.
Text-to-Video English
W
QuantStack
461
3
Seamless M4t V2 Large
SeamlessM4T v2 is a large-scale multilingual multimodal machine translation model released by Facebook, supporting speech and text translation for nearly 100 languages.
Text-to-Audio
Transformers Supports Multiple Languages

S
facebook
64.59k
821
Featured Recommended AI Models